Add vLLM Support for GPT OSS and its mapping generator for tunix. #2803

abhinavclemson · 2025-12-10T07:19:48Z

Description

Add vLLM Support for GPT OSS and its mapping generator for tunix.

Before submitting this PR, please make sure (put X in square brackets):

(https://maxtext.readthedocs.io/en/latest/development.html#adding-new-documentation-files).kets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

richjames0

lgtm

gagika · 2025-12-10T17:34:43Z

src/MaxText/integration/tunix/weight_mapping/gpt_oss.py

+
+      # TODO: Enable multi-host sharding, if there is a mismatch in shapes.
+      # # MULTI-HOST case.
+      val = jax.device_put(val, current.sharding)


was it tested at multi-host ?

what sharding do you think it might not work?

gagika · 2025-12-10T17:36:22Z

src/MaxText/integration/tunix/weight_mapping/gpt_oss.py

+      return current.at[..., 0::2].set(val)
+
+    def fuse_interleaved_up(val, tgt_param):
+      """Fuse Up (wi_1) with Multi-Host Sharding Support."""


why Multi-Host is special? do you mean multi-devie or multi-host?

From Jax's perspective I think only number of devices (and sharding) matters.

gagika · 2025-12-10T17:42:51Z

src/MaxText/integration/tunix/utils.py

-      return STANDALONE_VLLM_WEIGHT_MAPPING[self.model_name].to_hf_mapping()
+      mapping_fn = STANDALONE_VLLM_WEIGHT_MAPPING[self.model_name].to_hf_mapping
+      total_num_layers = self.config["num_hidden_layers"]
+      print(f"total_num_layers: {total_num_layers} for model: {self.model_name}")


could you remove this print or make debug logging (e.g. logging.debug)?

gagika · 2025-12-10T17:46:53Z

src/MaxText/integration/tunix/weight_mapping/gpt_oss.py

+      # TODO: Enable multi-host sharding, if there is a mismatch in shapes.
+      # # MULTI-HOST case.
+      val = jax.device_put(val, current.sharding)
+      val.block_until_ready()


is this needed?

abhinavclemson requested review from A9isha, NicoGrande, NuojCheng, RissyRan, SurbhiJainUSC, aireenmei, bvandermoon, gagika, gobbleturk, hengtaoguo, jiangjy1982, khatwanimohit, richjames0, shralex, suexu1025 and vipannalla as code owners December 10, 2025 07:19

abhinavclemson force-pushed the gpt-off-branch branch 3 times, most recently from 69d6317 to 9b0fe8f Compare December 10, 2025 07:37

Add GPT OSS MaxText to vLLM mappings and helper functions.

413bfab

abhinavclemson force-pushed the gpt-off-branch branch from 9b0fe8f to 413bfab Compare December 10, 2025 07:40

richjames0 approved these changes Dec 10, 2025

View reviewed changes

gagika reviewed Dec 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add vLLM Support for GPT OSS and its mapping generator for tunix. #2803

Add vLLM Support for GPT OSS and its mapping generator for tunix. #2803

abhinavclemson commented Dec 10, 2025

Uh oh!

richjames0 left a comment

Uh oh!

gagika Dec 10, 2025

Uh oh!

gagika Dec 10, 2025

Uh oh!

gagika Dec 10, 2025

Uh oh!

gagika Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add vLLM Support for GPT OSS and its mapping generator for tunix. #2803

Are you sure you want to change the base?

Add vLLM Support for GPT OSS and its mapping generator for tunix. #2803

Conversation

abhinavclemson commented Dec 10, 2025

Description

Add vLLM Support for GPT OSS and its mapping generator for tunix.

Uh oh!

richjames0 left a comment

Choose a reason for hiding this comment

Uh oh!

gagika Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gagika Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gagika Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gagika Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants